Creating Knowledge Repositories from Biomedical Reports: The MEDSYNDIKATE Text Mining System

نویسندگان

  • Udo Hahn
  • Martin Romacker
  • Stefan Schulz
چکیده

MEDSYNDIKATE is a natural language processor for automatically acquiring knowledge from medical finding reports. The content of these documents is transferred to formal representation structures which constitute a corresponding text knowledge base. The system architecture integrates requirements from the analysis of single sentences, as well as those of referentially linked sentences forming cohesive texts. The strong demands MEDSYNDIKATE poses to the availability of expressive knowledge sources are accounted for by two alternative approaches to (semi)automatic ontology engineering. We also present data for the knowledge extraction performance of MEDSYNDIKATE for three major syntactic patterns in medical documents.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MedSynDikate - a natural language system for the extraction of medical information from findings reports

MEDSYNDIKATE is a natural language processor, which automatically acquires medical information from findings reports. In the course of text analysis their contents is transferred to conceptual representation structures, which constitute a corresponding text knowledge base. MEDSYNDIKATE is particularly adapted to deal properly with text structures, such as various forms of anaphoric reference re...

متن کامل

Chapter 3 Lexical, terminological and ontological resources for biological text mining

Biomedical terminologies and ontologies are frequently described as enabling resources in text mining systems [e.g., 1, 2, 3]. These resources are used to supports tasks such as entity recognition (i.e., the identification of biomedical entities in text) and relation extraction (i.e., the identification of relationships among biomedical entities). Although a significant part of current text min...

متن کامل

Intelligent Approaches to Mining the Primary Research Literature: Techniques, Systems, and Examples

In this chapter, we describe how creating knowledge bases from the primary biomedical literature is formally equivalent to the process of performing a literature review or a ‘research synthesis’. We describe a principled approach to partitioning the research literature according to the different types of experiments performed by researchers and how knowledge engineering approaches must be caref...

متن کامل

Recent progress in automatically extracting information from the pharmacogenomic literature.

The biomedical literature holds our understanding of pharmacogenomics, but it is dispersed across many journals. In order to integrate our knowledge, connect important facts across publications and generate new hypotheses we must organize and encode the contents of the literature. By creating databases of structured pharmocogenomic knowledge, we can make the value of the literature much greater...

متن کامل

Lexical, Terminological, and Ontological Resources for Biological Text Mining

Biomedical terminologies and ontologies are frequently described as enabling resources in text mining systems [1–3]. These resources are used to support tasks such as entity recognition (i.e., the identification of biomedical entities in text), and relation extraction (i.e., the identification of relationships among biomedical entities). Although a significant part of current text mining effort...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

دوره   شماره 

صفحات  -

تاریخ انتشار 2002